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A crucial step in the regulation of gene expression is binding of transcription factor (TF) proteins 
to regulatory sites along the DNA. But transcription factors act at nanomolar concentrations, and 
noise due to random arrival of these molecules at their binding sites can severely limit the precision 
of regulation. Recent work on the optimization of information flow through regulatory networks 
indicates that the lower end of the dynamic range of concentrations is simply inaccessible, over¬ 
whelmed by the impact of this noise. Motivated by the behavior of homeodomain proteins, such as 
the maternal morphogen Bicoid in the fruit fly embryo, we suggest a scheme in which transcription 
factors also act as indirect translational regulators, binding to the mRNA of other transcription fac¬ 
tors. Intuitively, each mRNA molecule acts as an independent sensor of the TF concentration, and 
averaging over these multiple sensors reduces the noise. We analyze information flow through this 
new scheme and identify conditions under which it outperforms direct transcriptional regulation. 

Our results suggest that the dual role of homeodomain proteins is not just a historical accident, but 
a solution to a crucial physics problem in the regulation of gene expression. 


I. INTRODUCTION 

Cells control the concentration of proteins in part by 
regulating transcription, the process by which mRNA 
molecules are synthesized from the DNA template. Cen¬ 
tral to the regulation of transcription are the “transcrip¬ 
tion factor” (TF) proteins that bind to specific sites along 
the DNA and enhance or repress the expression of nearby 
genes. Perhaps surprisingly, many TF molecules are 
present at very low concentration, and even at low to¬ 
tal copy number [T]. While it has been appreciated for 
many years that low concentrations of biological signaling 
molecules must lead to significant noise levels [5], direct 
measurements of the fluctuations in gene expression have 
become possible only in the past fifteen years 0- 

Pathways for the regulation of gene expression can be 
seen as input-output devices, with information flowing 
from input control signals (TF concentrations) to out¬ 
put behaviors (number of synthesized protein molecules). 
While “information” usually is used colloquially in de¬ 
scribing biological systems, the mutual information be¬ 
tween input and output provides a unique, quantitative 
measure of the performance of these systems HIS]. In the 
context of embryonic development, for example, the in¬ 
formation (in bits) carried by gene expression levels sets 
a limit on the complexity and reproducibility of the body 
plans that can be encoded by these genes |^. 

Decades of work on neural coding provide a model for 
the use of information theory in exploring signaling pro¬ 
cesses in biological systems HIT]. To exploit this con¬ 
cept, as a first step it is necessary to estimate the various 
information theoretic quantities from data on real sys¬ 
tems, and for genetic regulatory networks that has been 
achieved only very recently. There are estimates of the 
mutual information between the concentration of a tran¬ 
scription factor and its target gene expression [HI [5], the 


information that expression levels of multiple genes carry 
about the position of cells in the developing fruit fly em¬ 
bryo [Mill], and the information that gene expression 
levels provide about external signals in mammalian cells 
[HIIH]. As a second step, we need to understand the¬ 
oretically how the various features of the systems—the 
architecture of signal transmission, the noise levels, the 
distribution of input signals—contribute to determining 
information transmission. In qualitative terms, the noise 
levels set a limit to information flow given a fixed max¬ 
imum signal level, and thus understanding information 
transmission is intimately connected to the question of 
how the cell can maximize the information conveyed by 
a limited number of molecules produced and transported 
stochastically [TiHM] ; completing the circle, this problem 
is directly analogous to the “efficient coding” problem in 
neural systems [25]. As emphasized in Ref. [5], infor¬ 
mation theoretic ideas can thus be used as tools for the 
quantitative characterization of biological systems, but 
there is also the more ambitious goal of building a theory 
in which the behavior of real neural, genetic, or biochem¬ 
ical networks could be derived, quantitatively, from the 
optimization of information flow. 

Transmitting maximum information with a limited 
number of molecules requires regulatory networks to em¬ 
body strategies for minimizing the effects of noise. Im¬ 
portantly, there are (at least) two contributions to the 
noise |26j . and optimal networks find a balance between 
these. The more widely appreciated component of noise 
in transcriptional regulation comes from the stochastic 
birth and death of the synthesized protein and mRNA 
molecules which we refer to as “output noise.” But 
there is also noise at the input of the regulatory process, 
from the random arrival of transcription factor molecules 
at their binding sites. We can think of transcriptional 
regulation as mechanisms for sensing the concentration 


2 


of TFs, connecting the analysis of “input noise” to the 
broader problem of limits on biochemical signaling and 
sensing first studied in the context of bacterial 

chemotaxis [5] • Changing the shape of input-output rela¬ 
tions, both through cooperativity and through feedback, 
changes the balance between input and output noise, thus 
rendering the optimization of information flow a well- 
posed problem even with very simple physical constraints 
on the total mean number of molecules [TH1418) . 


Central to any account of noise reduction is the effect 
of averaging. In the context of transcriptional regulation, 
there is averaging over time as molecules accumulate, av¬ 
eraging over expression levels of multiple genes that are 
regulated by the same TF, and averaging over space as 
molecules diffuse between neighboring cells or nuclei, e.g., 
in a developing embryo [TH or organoid [3D] . In 

the (typical) case where one TF targets multiple genes, 
there is a regime where information transmission is opti¬ 
mized by complete redundancy in the response of these 
targets, and another regime in which the concentrations 
for activation or repression of the targets are staggered so 
as to “tile” the dynamic range of inputs m- But, even as 
we consider networks with increasing numbers of targets, 
the optimal strategy is to insert the additional genes into 
the high concentration end of the input range, leaving 
the lower part of the dynamic range largely unused for 
information transmission. In effect, the input noise at 
low concentrations is too high for reliable signaling, and 
the solution is simply to avoid this regime. 


Here we explore a scenario that allows transcription 
factors to recover access to the low end of their dynamic 
range. Since these proteins bind to DNA, it is plausi¬ 
ble that they could also bind to mRNA, thereby regu¬ 
lating translation; this is known to happen in the large 
class of homeodomain proteins [4TV[^ . Intuitively, each 
mRNA molecule could act as an independent sensor of 
the TF concentration, and averaging over these multiple 
sensors could reduce the input noise and thereby allow 
for more effective information transmission at low TF 
concentrations. To develop this intuition, we first con¬ 
sider a model of the “direct transcriptional regulation” 
(DTR) scheme, in which a TF concentration is read out 
by M binding sites on the same promoter (Section [lT|; 
we subsequently generalize the model to a more compli¬ 
cated “indirect translational regulation” (ITR) scheme, 
in which the redundant readout function is served by 
M cytoplasmic mRNA molecules (Section |III[ ). We com¬ 
pare the two regulation mechanisms by computing the 
maximum information flow in each as a function of the 
input noise magnitude and other determinants of the in¬ 
formation flow (Section |rv|. We conclude by discussing 
a biologically relevant example from early Drosophila de¬ 
velopment (Section [vj). 


II. AVERAGING OVER NEIGHBORING 
REGULATORY REGIONS IN DIRECT 
TRANSCRIPTIONAL REGULATION 


The intuition behind the arguments of this work is that 
a cell can extract more information from low concentra¬ 
tions of transcription factors by averaging over multiple 
binding regions. We expect that this will be realized by 
having the multiple binding regions on different mRNA 
molecules. As a motivating exercise, however, we can 
imagine that there are many regions for binding of the 
transcription factor at a single target near the gene being 
regulated, and that the expression of this gene depends 
on the average of the occupancies of these regions (see 
schematic in Fig. [^); there are hints that such non- 
cooperative regulation by a cluster of binding regions 
may be realized in some cases |44j. We expect that, with 
averaging over M binding regions, we should find a ^/M 
reduction in noise levels, and our goal here is to exhibit 
this explicitly, as well as to understand the conditions for 
this reduction to be achieved. These results will provide a 
guide to the more complex case of “indirect translational 
regulation” (ITR), introduced in Sec. Ill The calcula- 


tional framework we use here is based on our previous 

work HIIDHIH]- 

We write the expression level of the single target gene 
as g, and if expression is controlled by the average of 
multiple nearby regulatory regions then the dynamics are 
of the form 


djl 

dt 


= r 
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( 1 ) 


where r is the maximal rate of synthesis, 1/t is the rate 
at which the gene products are degraded, and ^ is a 
Langevin noise source (zero-mean white noise). In this 
model there is a single transcription factor, at concentra¬ 
tion c, that controls expression. We assume r to be the 
longest time scale in the problem, thus setting the aver¬ 
aging time for all noise sources in the regulatory pathway. 
As described more fully in Refs. [aiisHis], we can think 
of the regulatory mechanism as propagating information 
from c to g, and this information transmission is a mea¬ 
sure of the control power achieved by the system. 

In the simplest case, each region harbors just one bind¬ 
ing site, and the contribution of that site to the activa¬ 
tion of gene expression is determined by its equilibrium 
occupancy hi € [0,1]; then we have 

/,(c)=hi(c)= (2) 

c + Ki 

where Ki is the binding constant or affinity of site i 
for the transcription factor. Alternatively, each region, 
corresponding to a regulatory sequence along the DNA, 
could be a tight cluster of binding sites that act cooper¬ 
atively, so that 


/i(c) 


r,Hi 


rHi 


+ K^. 
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where Hi is the Hill coefficient describing the cooperativ- 
ity. Note that in this parameterization, we can describe 
activators and repressors by the same equation, using 
positive and negative Hi, respectively. 

The noise term ^ should include many different micro¬ 
scopic effects. There is noise in the synthesis and degra¬ 
dation of the gene product (output noise), and there is 
noise in the arrival of the transcription factor at its tar¬ 
get site (input noise). As in Refs. [TSHU], we describe 
the output noise as a birth-death process, and subsume 
several complexities by assuming that we are counting 
“independent events” without making a detailed com¬ 
mitment about their nature (e.g., whether the mRNA or 
protein molecules are independent, or if the truly inde¬ 
pendent events are bursts of transcription [H]); we will, 
however, consider these aspects in more detail in the ITR 
model (Sec. Ilf). 

For the input noise, there is a minimum level set by the 
Berg-Purcell limit [H UHl 1311 ISi] , which is equivalent to a 
variance in the concentration of the transcription factor 


c,(in) 


Dr 


-$B(n(c)), 


(4) 


where Dc is the diffusion constant of the transcription 
factor, £c is the effective linear dimension of the binding 
region, r is the integration time over which noise is av¬ 
eraged, and the “occupancy factor” $ 5 ( 71 ( 0 )) is a func¬ 
tion of the average occupancy that depends on molec¬ 
ular details, with B denoting the number of binding 
sites per regulatory region. In particular, for a single 
binding site with equilibrium occupancy hi, we expect 
$1 = (1 — hi)“^ [3U[33]; for a cluster of sites in the limit 
as their number grows large, such that a single site is 
never fully saturated, $b —>■ $00 = 1 [HISS]- 

We can cast both input and output noise into the 
Langevin form (cf. Eq. [^, but we know from Refs. [T51 - 
nni that, so long as they are not too large, these noise 
sources provide additive contributions to the variance of 
g. We can find the effect of the input noise by “propa¬ 
gating errors” through the mean input-output relations, 
and then add to the output noise: 



The first term is the Poisson output noise, with the vari¬ 
ance equal to the stationary mean, which can be com¬ 
puted from Eq. 0: 


M 

= ( 6 ) 

^ i=l 

Note that since fi € [0,1], the maximum mean number 
of output molecules is A^max = rr. 

Let us now assume, for simplicity, that all regulation 
functions fi are identical: all Hi = H, all Ki = K, and 
hence all fi{c) = f{c) = n{c). The total noise in gene 
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FIG. 1. (Color online) Schematic comparison of direct 
transcriptional regulation (DTR) and indirect tran- 
lational regulation (ITR) schemes. (A) In direct tran¬ 
scriptional regulation (DTR), activator (or repressor) TFs, 
depicted as green squares and present at concentration c, in¬ 
teract with (potentially multiple, not necessarily identical) 
TF binding regions to activate (repress) expression of the 
regulated gene g. (B) In the indirect translational regula¬ 
tion (ITR) scenario, input molecules (green squares) bind to 
mRNA m of protein y (red chain) to make the mRNA un- 
accessible for translation (gray oval). Translation can pro¬ 
ceed from unbound mRNA molecules, giving rise to proteins 
y (red stars). These proteins act as repressors (or activators) 
for gene g; the overall mapping from c to gr is thus activating 
(repressing) in both scenarios. 


expression in a model with M identical binding regions 
then reads: 



(7) 


Introducing, consistently with our previous work, a di¬ 
mensionless concentration unit cq = Afmax/L^c^cT, and 
measuring expression levels g in units of maximal induc¬ 
tion Amax = tt, we obscrve that the mean expression is 
simply g = f{c), and the noise can be written as 


1 

_ , '1>B 

1- 

1 
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K^dcJ 


If the input concentration c has a limited dynamic range, 
i.e., c G [0, C], where C = Cmax/co is the maximal allowed 
concentration of the input in units of cq , the relative im¬ 
portance of the two noise terms is set by C. For C » 1, 
it is possible to regulate the gene such that the input 
noise contribution [second term of Eq. (|^] is negligible 
compared to the output noise [first term of Eq. ([^]. For 
C 1, the input noise is dominant and the output noise 
is negligible, unless M is large. The balancing of these 
noise terms has been explored in detail in our previous 
work [TdlllQ) . 

Alternatively, the total noise at the output from Eq. Q 
can be mapped to an equivalent noise at the input 
through the slope of the input-output relation g'{c), 
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C 


N„ 
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Equations contain two differences compared to 
the single input/single output case reported in Ref. [E]. 
First, the “occupancy factor” is introduced by a re¬ 
finement of the expression for the input noise [311 135] : 
this change will not qualitatively influence our conclu¬ 
sions. Moreover, as mentioned above, the significance of 
this correction decreases as the number of binding sites 
per region increases [31]. Second, the factor 1/M mul¬ 
tiplying the input noise contribution suggests that the 
input noise can be decreased by averaging over multiple 
binding regions for the transcription factor. 


One possible reaction scheme for indirect translational 
regulation consists of the following system of equations: 


= Tm - m — k+cm + k-b 


— = fc+c m — k-b — 


dm 

dt 

db 

dt 

dy 

dt 

dt 


1 


= r„m — 


= rf{y/^) - -g. 

r 


( 10 ) 

( 11 ) 

( 12 ) 


(13) 


If the transcriptional regulatory apparatus really were 
driven to strongly suppress the input noise, e.g., by a fac¬ 
tor of order 10, this would necessitate large M, and it is 
hard to imagine how 10^ — 10^ binding regions could be 
packed into a linear regulatory section on the DNA. One 
difficulty with this is that there is no plausible molecular 
machinery which could read out the average occupancy 
of so many regions. The other, more fundamental diffi¬ 
culty is that, due to close packing on the DNA, such reg¬ 
ulatory regions would interact and thus fail to provide 
independent concentration measurements, likely negat¬ 
ing the apparent benefits of input noise averaging. This 
effect is well known since the original work of Berg and 
Purcell in the context of chemoreception [2]. 


In the following section we will show that translational 
regulation implements an input noise reduction mecha¬ 
nism that is conceptually identical to that of M bind¬ 
ing regions, while automatically removing the two asso¬ 
ciated problems discussed above. We will compare the 
noise reduction in the “indirect translational regulation” 
(ITR) mechanism to the “direct transcriptional regula¬ 
tion” (DTR) case, which we define as the simple scheme 
using M = 1 regulatory element with noise given by 
Eqs. ([§§). 


III. INDIRECT TRANSLATIONAL 
REGULATION 


In the indirect translational regulation (ITR) scenario, 
the gene g is not regulated directly, but through an in¬ 
termediate step. Let us assume that TF c translationally 
represses the mRNA m of a protein whose copy number 
we will denote by y; this protein acts as a repressing (or 
activating) TF for the output protein g, as depicted in 
Fig. [Tfl. As a result, the end transformation of inputs 
c to outputs g is again activating (or repressing), and 
can be compared to the respective direct transcriptional 
regulation pathway (see Fig. [^. 


Here, the mRNA of the intermediary gene y is produced 
at rate and degraded at rate t“^. It can be bound 
by the input transcription factor at rate fc+c, and un¬ 
bound at rate fc_. The variable m tracks the number of 
unbound mRNA from which translation can proceed; b 
tracks the repressed (bound) mRNA number. Transla¬ 
tion of the unbound mRNA occurs at rate and the 
protein is degraded with rate . Finally, y controls the 
expression of g, as the input c does in the DTR scenario, 
through a regulatory function /: g proteins are expressed 
with maximal rate r, and degraded with rate I/t, by as¬ 
sumption the slowest time scale in the problem. Here to, 
b and y are given as absolute molecule counts, c is given 
in concentration units. The function /(•) is defined to 
take concentration as input to parallel the DTR scenario; 
the expression level of the intermediary gene y therefore 
must be divided by the relevant reaction volume, H, when 
inserted into /. Equations ( [I0][T3 ) still have to be sup¬ 
plemented by the associated Langevin noise terms; we 
analyze the noise in detail below. 

By solving Eqs. ( I0][l3 ) in the steady state, we obtain 
the average levels of signaling molecules: 


I -I- — 


y = VyTym 
g = TT, 


/(!)■ 


(14) 


Here, Kc = , and in the limit of fast binding and 

unbinding (k-Tm ^ I) this reduces to Kc ~ fc_/fc+, 
which is akin to the familiar form for the dissociation 
constant. As before, we define A/nax = rr to be the max¬ 
imum number of molecules at the output. Analogously, 
let M — TruTm be the steady state number of mRNA, 
either active (unbound by c) or repressed (bound by c), 
i.e., M = fh + b. 

To compute the noise in gene expression at steady 
state, we consider the following noise sources: (i) shot 
noise due to the production and degradation of g; (ii) 
shot noise due to the production and degradation of the 
protein y, (iii) shot noise due to production and degrada¬ 
tion of the mRNA to; (iv) diffusion noise due to random 
arrival of y molecules at the promoter of g; and (v), dif¬ 
fusion noise due to random arrival of c molecules at the 
mRNA of y. Only two of these noise sources [(i) and 
(v)] arise from directly analogous processes in the DTR 
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scheme. The additional sources reflect the increased com¬ 
plexity of the ITR scheme, suggesting that ITR will only 
be beneficial over DTR when the reduction in the input 
noise of c (v) is large enough to compensate for the effect 
of the new noise sources. 

As in previous work (Refs [TMT!?] . but see also [20)1. 
this analysis neglects switching noise, i.e., the noise 
due to stochastic transitions between different occupancy 
states of the promoter [msMi]. One reason for this is 
that such noise contributions depend on the exact molec¬ 
ular mechanisms operational at the promoter, which we 
do not know in detail. The other reason is that the 
noise contributions analyzed here represent the physi¬ 
cal bounds due to finite number of molecules. At suf¬ 
ficiently low concentrations, which are the focus of in¬ 
terest here, these noise contributions will overwhelm the 
switching noise; more generally we can imagine that cells 
have evolved mechanisms that minimize these adjustable 
sources of noise, leaving the physically inevitable noise 
sources to dominate. We will now analyze the effect of 
the different noise contributions on the steady state vari¬ 
ance of the output g in detail. 

Birth-death noise sources (i)-(iii). To correctly com¬ 
pute the birth-death (shot) noise contributions to the 
total noise at the output, it is instructive to consider the 
propagation of arbitrary shot noise sources in a generic 
signaling cascade of the form: 

dxi 1 ^ 

^ = r2h{xi) - —X2 + 6 
at To 


fi'T 1 

^=r^fn{x„-l)--Xr.+^n 
dt Tn 


(15) 


where the shot noise spectra are 


(^j(t)Cfc(t')) = 2 r- ^XjSjkS{t - t') 


(16) 


in the steady state, with Xj = (xj) the stationary mean. 
Assuming that 3> Tj for all j n, it can be shown 
by linearizing and Fourier transforming Eqs. (151 (see 
Appendix © that the total variance in to a good 
approximation, is where 


'n^j 


n 2 


n 

q=j+l 


X Xi 


(17) 


Equation ( [T7| ) is intuitive to interpret: shot noise entering 
the cascade at step j has variance equal to the mean, Xj, 
which gets filtered by the temporal averaging, Tj/xn, over 
what is ultimately the slowest timescale in the problem. 


Tn = T, and is finally propagated through all subsequent 
stages of the signaling pathway, given by the gain factors 
and slopes of the input-output relations [5^ . 

Diffusion noise sources (iv) and (v). Here we first note 
that the contribution of diffusion noise sources to the 
variance in the output can be generically written as 


a 


2 

g<^x 







(18) 


where x is the concentration of the diffusing species, $ 3 , is 
a function of the internal state of the regulatory region to 
which X is binding, £ 3 , is its linear extent, and the deriva¬ 
tive acts on the steady-state transformation between x 
and the mean output g, given by Eqs. (14). 


For noise source (iv), the diffusive species is y, and 
the target is the transcription factor binding site control¬ 
ling the expression of g. The relevant input-output rela¬ 
tion through which the noise is propagated is = 

rTf'{y/il); to allow for flexible regulation that can imple¬ 
ment different input-output curves /(y/H), we will later 
assume that the expression is effected by many binding 
sites, resulting in ~ 1 . 

For noise source (v), the diffusive species is c, and the 
targets are mRNA m. Since each mRNA molecule acts 
as an autonomous “detector” for the concentration c, the 
relevant “receptor occupancy” is the probability of a sin¬ 
gle mRNA to be bound and thus repressed by c; denoting 
the average single-mRNA occupancy as fhi = rh/M, we 
can write the diffusion noise for each single mRNA as: 


^2 _ 

’X'lrii ~ 


/ dmi\ 

1 ’ 

f I dm\ 


1 dc J 


\M dc ) 

Dcm^c'^ 


(19) 


where Dcm = Dc Dm, accounts for the fact that in the 
ITR scenario also the detectors can be mobile; usually we 
can expect Dc Dm, such that Dcm — Dc, and in the 
following we thus set Dcm = Dc- As in the direct regu¬ 
lation model, we can add up the diffusion noise for the 
identical but independent c-detectors to obtain the diffu¬ 
sion noise in the total mRNA population: 

Propagating this noise through the downstream input- 
output relations finally yields the expression for noise 
source (v): 




dg 

dm 


1 dm^ ^ 


^ M dc 


c 


Dc 


( 20 ) 


Assuming a single TF binding site per mRNA, we can 
expect = (1 — b/M)~^ [32], which we use in our sub¬ 
sequent calculations. 

Assembling all noise sources together. Applying the 
above considerations to the ITR regulation sheme de¬ 
fined by Eqs. ( 10|[T3 ), we can write down the steady state 
variance in the output as: 
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n 


TTyV^ 




DylyTVt 


DLt 




(21) 

M dc ) V dm I 


(i) 


(ii) 


(iv) 


(v) 


Let us now choose a set of units that is natural for the 
ITR scheme and consistent with the DTR scenario. As 
before, we measure the output g in units of A^max = tt 
such that it falls into [0,1], and we measure the concen¬ 
tration c in units of cq, 


Co 


N 

max 

DcicT 


( 22 ) 


In direct analogy, we choose a concentration unit for pro¬ 
teins y: 


2/0 


N 

’max 

T) P T 


(23) 


Since the binding sites and diffusion constants for TFs 
y and c are in principle different, these units could be 
different, but we will later assume them to be similar. 
Let us also define: 


2/max — 


^yo ’ 


(24) 


as the maximum (dimensionless) concentration of pro¬ 
teins y expressed from a single mRNA. 


Binding and unbinding of c to mRNA defines the first 
nonlinearity of the problem; denoting the average occu¬ 
pancy of a single mRNA molecule by h{c), we have 


h{c) 


Kg 

Kg + C^ 


(25) 


where c and Kg = (1 + k-Tm)/(k+TmCo) are both dimen¬ 
sionless. Note that h{c) = fh/M. The second nonlinear 
function is /(y), determining the output expression level 
of g (as in the DTR scenario); its argument is a dimen¬ 
sionless concentration of y, measured in units of j/o- We 
will explore the space of functions with a Hill form 


/(y) = yH^ (26) 

where K is also measured in units of y^, and the Hill 
coefficient H roughly corresponds to the number of y 
binding sites on the promoter of g. Here again the sign 
of H determines whether y is an activator or repressor 
to g. In this work, we will focus on the case H < 0 
(meaning that y activates g, while itself being repressed 
by the upstream input c, resulting in overall negative 
regulation of g hy c), and compare to the DTR model in 
which g is repressed by c as well [iL < 0 in Eq. ([^]. 


We can rewrite the noise in Eq. (21) as the effective noise at the input. 


cr 


2 

a 


( df y / d{y/yo) Y / dm \ 
\d{y/yo)) V dfh ) \dc) 


_2 _ 2 ]\/f2£f2-Lf2 2 

^g — 2/max-^ / " ' 


(27) 


After rearranging terms and writing all copy numbers and concentrations in the new units defined above, we obtain 






1/11 

M2 /'2 h'^ y. 


2 

max 


(i) 


F h 
' MhF ' 


1 h 

M /l'2 y^^ 

~iK) 


m‘ 

(v) J 


(28) 


where 


F=—{l + ryT^), (29) 

ry 

and / and its derivative are evaluated at y/yo = 2/maxth = 2 /max Af/i(c). As before, the input concentrations can 
vary across the range c G [0, C], where C = Cy^axlco is the dimensionless maximal concentration of the input. If gene 
expression rates for the target and intermediary proteins were similar, r ~ r^, then F « 1 + ryTm, which is reminiscent 
of the Eano factor due to the “burst size” TyTm, the number of proteins of y expressed on average from one mRNA; we 
will therefore refer to F by “Eano factor” in the following. In a modified variant of the ITR model where mRNA are 
not continuously created and degraded to maintain an average of M copies, but are present at a fixed total number 
of M copies, F = rjry « 1. 
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It is useful to remind ourselves of the corresponding 
result in the DTR case, which is Eq. with M = 1, 


a 


2 

c 


1 





(30) 


We see that term (i) of the ITR case is equal to term (i') 
in the DTR case, although this identification requires us 
to propagate the inputs through more layers of response 
in the ITR, hence a more complex expression. Similarly, 
term (v) of the ITR case, representing diffusive noise for 
c-molecules binding to mRNA to, is directly analogous 
to the diffusion noise term (v') in the DTR case, but 
ITR reduces the input noise variance by a factor of M. 
While Eq. (281 appears complicated, we can nevertheless 
estimate the relative magnitudes of different noise sources 
and assess their relevance. 

Let us first compare the relative magnitude of the two 
input-type noise sources. The scale of the ITR noise com¬ 
ponent due to finite number of y molecules (iv) relative 
to the input noise in c (v) is 


(v) _ h ^ 2/max 

TVW — 2/max ~ 

(iVj h <Py C 


(31) 


We expect that $ and regulatory functions h are of or¬ 
der unity, and that the natural scale of the concentra¬ 
tion c is given by C, the maximal dimensionless input 
concentration; then the scale of the derivative of h is 
h' ~ leading to the final result. The importance of 
this term thus depends on the comparison of the maxi¬ 
mal concentration of the intermediary y proteins and the 
input proteins c. Clearly, if the intermediary proteins of 
y are present at very low copy numbers, their diffusion 
noise will become limiting, and the ITR scheme will be 
ineffectual. 

The scale of input noise (v) relative to the components 
of the ITR noise due to birth-death processes, (ii)-f (iii), 
is given by 


(v) h'^ ^ c 1 

(ii) -b (iii) “ FC' 


(32) 


As we argue above, we should expect values of E > 1. 
The importance of noise sources (ii)-l-(iii) thus depends 
on the scale of F relative to C, and in the regime of 
low input concentration, C ^ I, noise sources (ii)H-(iii) 
will be negligible for models with small transcriptional 
bursting for y proteins, i.e., when F ^ 1. 

Einally, we can assess the relative scale of the input 
noise (v) with respect to the output noise (i): 


FI being the Hill coefficient of /. As a result, the depen¬ 
dence of the noise term (i) on i/max exactly cancels out; 
similarly, the M-dependence of term (i) cancels out ex¬ 
actly. This is to be expected: by increasing M, one can 
average away the input noise, but the magnitude of the 
output noise can not be reduced. The same is true for 
Eq. where the output noise contribution is not di¬ 
vided by M. This term is thus important as it must 
become limiting when M grows large, with the relevant 
scale being set by H^/MC ~ I. The quadratic scaling 
with H shows that steep regulation curves strongly am¬ 
plify the noise on the input side. 


IV. COMPARING OPTIMAL INFORMATION 
FLOW IN THE TWO SCHEMES 


To compare the regulatory power of direct transcrip¬ 
tional regulation (DTR) and indirect translational regu¬ 
lation (ITR), we ask how much information can be trans¬ 
mitted through each scheme. If noise in gene expression 
is Gaussian, as we assumed here, then the response of the 
regulatory pathway is fully characterized at steady state 
by the distribution P{g\c) = Q{g;g{c),ag{c)), where Q is 
a normal distribution with the mean and standard devi¬ 
ation that can be computed from the Langevin equations 
Eqs. ( flpp^ . The mutual information, J(c;g), between 
the regulator concentration, c, and the downstream ex¬ 
pression level, (?, is then given by: 

H9',c) = J dcPc{c) J dg P{g\c)\og 2 ^^^^, (34) 


where Pg{g) = J dc Pc(c)P(gjc). Mutual information, a 
non-negative number measured in bits, tells us how pre¬ 
cisely the input c determines the output g, given that 
the noise places limits to the fidelity of this control. This 
quantity still depends on the distribution of input TE 
levels, Pc{c), experienced by the regulatory pathway. It 
is possible to find the optimal distribution P*{c) that 
maximizes the information, and the maximum achievable 
information is then referred to as the (channel) capacity 
HH]. P*(c) is tuned to the noise properties of the reg¬ 
ulation process, favoring the use of input concentrations 
at which the regulatory element responds with smaller 
noise over those inputs where noise is higher. 

While finding the optimal input distribution and the 
corresponding maximal information I is difficult in gen¬ 
eral, in the case of small noise, Ug <?; I, we previously 
derived a simple formula for the capacity [S]: 


I{c]g) = log 2 



= loga Z + - log2(Afn 


(35) 


(l) 

(i) 


= M 


h '^ /'2 
/ 




MG' 


where 

(33) 


Here, the regulation functions / and h are again of order 
unity, but the derivative of / has a scale of H/(My^ay^h), 


"=r^- 

Jo crpc) 

Z = = l)l\/2TTe 


(36) 

(37) 
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FIG. 2. (Color online) Optimal capacity of direct transcriptional regulation (DTR, black), compared to indirect 
translational regulation (ITR, red), as a function of the maximal input concentration C. (A) Optimal capacity 
log 2 Z VS. C for various total amounts of translationally targeted mRNAs, M. Here and in subsequent panels we fix Amax, 
the maximal output copy number, to a reference value Amax = 1 and show the resulting capacity; larger values of Amax simply 
shift all capacities upwards by an additive amount, as in Eq. ( |35[ ). The strength of “ITR noise” sources, (ii) + (iii) + (iv) of 
Eq. (28 1, is set by (Fano factor) F = 1 and (maximal concentration of the intermediary protein) j/max = 10. For M > 1, ITR 
outperforms DTR at low C, but at high C DTR can still reach higher capacities if M is not sufficiently large (e.g., dashed red 
line for M = 10). At M = 1 (dotted red line), the ITR scheme cannot benefit from input noise averaging, yet the intermediary 
regulatory steps still contribute the “ITR noise” absent in DTR, causing DTR to be superior to ITR at all C. (B) Capacity 
curves for different values of M in (A) collapse when plotted against the product of maximal input concentration and the 
number of mRNA targets, MC, as predicted by the scaling relation in Eq. (33l. Increasing input noise by lowering C thus can 
be compensated for by increasing M. The collapse is not perfect at high MC and curves for different M saturate at different 
capacity values because the strength of “ITR noise” is not negligible and their effect on capacity depends on M. 


and tTc is given either by Eq. ([^ for the case of direct 
transcriptional regulation, or by Eq. (28) for the case 


of indirect translational regulation. Z plays the role of 
the normalization constant in the distribution over in¬ 
puts that maximizes I{c]g)] the optimal distribution is 
P*(c) = The simple dependence of capacity 

on Aniax in Eq. (|3^ follows because A^ax only enters the 


expression as a multiplicative prefactor to ctc- 


The capacity given by Eqs. (35 36) still depends on 
the regulatory parameters. We view the maximum con¬ 
centration of input molecules, C, not as a parameter but 
as a constraint; in the ITR case there is also a constraint 
on the number of mRNA molecules, M, and, less impor¬ 
tantly, on the statistics of translation, captured by F. In 
the DTR scheme, the parameters that we can adjust in 
order to optimize information transmission are the dis¬ 
sociation constant (AT) and the Hill coefficient (H) of 
the regulatory function / in Eq. ([^. Eor ITR, we have 
the same two parameters determining the properties of 
the regulatory function / in Eq. (d), plus an extra pa¬ 
rameter, Kc, the dissociation constant for the repression 
function h(c). Since our focus is on the regime where 
input noise is limiting, C <C 1, we fix ?/max and F such 
that “ITR noise” sources, relating to intermediary com¬ 


pounds, i.e., the mRNA and protein of y, are not dom¬ 
inant. The analysis of noise terms from the preceding 
section suggests that choosing F = 1 and ?/max = 10 will 
ensure that noise in the ITR scenario is dominated by 
the input noise in c [(v)] and the output noise due to 
production and degradation of g [(i)]. In this regime, 
the performance of the complex ITR regulatory scheme 
should approach the direct regulation using M regulatory 
elements, Eq. ©■ 


A. Optimal solutions 

We mapped the optimal capacity as a function of C 
for the DTR and ITR scenarios at various values for M, 
with the results shown in Fig.[^ Eor each point on the ca¬ 
pacity curves, information was maximized with respect 
to the regulatory parameters ({A, A} in case of DTR, 
{A, A, Ac} in the case of ITR). As expected, increas¬ 
ing M clearly enhances the capacity by additionally sup¬ 
pressing dominant input noise at C < 1 relative to direct 
regulation. For C > 1, the performance of ITR vs. DTR 
depends on the choice of i/max and A, which determine 
when additional noise sources specific to the ITR scheme 
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C = 1000, F = 1 
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FIG. 3. (Color online) Scaling of the optimal capacity with “ITR noise” parameters t/max and 1/F. The optimal 
capacity logj Z for different fixed values of the mRNA number M in the ITR model is plotted against “ITR noise” parameters 
properly rescaled by M, to observe the compensation between translational noise components and the input noise. (A) For the 
diffusion noise due to the intermediary protein y, the relevant parameter is M x ymax, where j/max is the maximal concentration 
of y. (B) For the shot noise due to the expression of intermediary mRNA and protein y, the relevant parameter is M x (l/F), 
where l/T is the inverse Fano factor. In both cases, we set the remaining noise sources to be as small as possible: C = 1000 
and F = 1 for (A), C = 1000 and ymax = 1000 for (B). A perfect collapse in (A), comparable to that in (B), could only be 
achieved for Fano factors F <C 1, which are biologically unrealistic. 


become important. 

At very small C ~ 10“^, the input noise clearly limits 
capacity in the DTR scenario. As we switch over to the 
ITR scenario with M = 10, we expect the (dominant) in¬ 
put noise variance to drop by a factor of 10, leading to an 
increase in /(c; g) of log 2 -s/lO « 1.7 bits, roughly equal to 
the observed difference between the DTR curve and the 
ITR curve for M = 10. As M is increased, this scaling 
breaks down because the reduced input noise stops be¬ 
ing the sole factor limiting information transmission, and 
the capacity curves flatten as a function of C, saturat¬ 
ing towards the limit where the transmission is limited 
only by output and ITR noise. We draw attention to the 
magnitude of these effects: the capacity of real regula¬ 
tory elements is in the range of one to several bits [8] , so 
that differences on the order of one bit are huge. 

The value of C where the capacity begins to saturate 
depends on M, because increasing M is equivalent to in¬ 
creasing C as predicted by the scaling relation of Eq. (33). 
To make this explicit we plotted the same capacity data 
against MC instead of C, as shown in Fig. [2J3. As ex¬ 
pected, the curves collapse in the low-MC regime, where 
input noise is dominant, and start to saturate at similar 
MC values. However, they do not reach the same plateau 
at saturation, because the ITR noise contributions, (ii) -I- 
(iii) -I- (iv), are non-negligible. To see that these contribu¬ 
tions can be traded off against the decrease in input noise 
set by M as well, we varied parameters F and i/max at 
fixed C = 1000 in Fig. such that only one noise source 


remained non-negligible compared to the others. When 
the resulting capacity is plotted against the appropriately 
rescaled versions of the parameters, Mj/max and M/F, re¬ 
spectively, we again see a collapse of the capacity curves 
for different M, as predicted in Section Ell In Fig. 
the collapse is almost perfect, because y-input noise (iv) 
dominates whereas the respective other two noise sources 
are negligible, and the plateau thus is purely set by out¬ 
put noise. In Fig. [3^ this is not possible because under 
realistic conditions the Fano factor F > 1, meaning that 
the ITR noise cannot be completely eliminated; however, 
we verihed that a perfect collapse can also be achieved 
here by setting F to unrealistically low values F <C 1. 


What do the optimal regulatory curves look like? Fig¬ 
ure 1^4 shows that at low C = 0.1, where capacity en¬ 
hancement by translational regulation is largest, the low¬ 
ered input noise in the ITR model allows for markedly 
steeper effective input-output curves g{c), especially for 
M ^ 1. At larger C, the difference in capacity between 
ITR and DTR decreases, as expected from Fig. and 
the input-output curves converge towards similar effec¬ 
tive Hill coefficients in both models, but ITR has a sig¬ 
nificantly lower midpoint input concentration than DTR. 
These observations are confirmed by a more systematic 
analysis, presented in Figs.|4j3 and C, where we plot the 
optimal (effective) transition point K*g and the optimal 
Hill coefficient H*, respectively, as functions of C. 
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FIG. 4. (Color online) Optimal regulatory curves. (A) Optimal regulatory curves, g{c), for C = 0.1 (left), (7=1 (middle) 
and (7 = 10 (right), and different values of M in the indirect translational regulation (ITR) model (colored curves): M = 10 
(top), M = 100 (middle), M = 1000 (bottom). The direct transcriptional regulation (DTR) model is plotted in black for 
reference. Colors correspond to the vertical lines in Fig. Here, F = 1 and j/max = 10. A/, the difference in capacity between 
ITR and DTR in bits, is shown in each panel. (B, C) Optimal regulation threshold, K*g, and the optimal Hill coefficient, H*, 
that characterize the regulatory curves in (A), shown as a function of C. Since the overall regulation is repressive in either 
scheme. Hill coefficients are negative. In the DTR model, K^s = K\ for ITR, K^s = (Ac/A)Mi/max — Ka- 


B. Comparison at equal resources 

In the previous section we compared the optimal op¬ 
erating points for the two regulation scenarios indepen¬ 
dently, neglecting the resource cost associated with reach¬ 
ing the corresponding optima. However, indirect transla¬ 
tional regulation evidently consumes more resources than 
direct regulation. The extra resource cost is set by M, 
which determines the cost of the mRNA for the interme¬ 
diary protein y, as well as j/max) which determines the 
cost of the y protein itself. While it is difficult to con¬ 
vert the cost of mRNA and cost of protein into the same 
“currency,” we can nonetheless compare the costs of the 
ITR and DTR schemes in terms of the total number of 
mRNA molecules M. alone. 

In the DTR scheme, a good estimate for this cost is 
M. = iVmax(ff), where {g) = / g{c)P*{c)dc is the ex¬ 
pected number of independent output molecules in re¬ 
sponse to the optimal distribution of inputs, P*{c). In 
the ITR scheme, this cost is increased by the number 


of mRNA molecules needed to implement transcriptional 
regulation, M, such that M. = iVmax( 5 )+Af. In this case, 
{g) is implicitly a function of M, because different val¬ 
ues of M lead to different noise levels and consequently 
different optimal input distributions P*{c). 

When we fix the total resource cost Ai in both scenar¬ 
ios, the information in the direct scenario is given by: 


I{c; g) = log 2 ^ -b - log 2 N„ 


log2 ^ + 2 


M 

(9) ■ 
(38) 


For translational repression, the (optimized) information 
reads 

1 M- M* 

I (c; g) = log 2 Z{M*) + - log 2 -- , (39) 

where M* is the value of M £ [0, A4] that optimizes 
the capacity, reflecting the optimal allocation of available 
resources, A4, between the translational mechanism that 
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FIG. 5. (Color online) Direct transcriptional regulation (DTR) vs. indirect translational regulation (ITR) at 
equal resources. (A) Comparison of the maximal capacity as a function of C between indirect translational regulation (ITR, 
red curves) and direct transcriptional regulation (DTR, black curves) scenarios, for three values of total resource cost (total 
mRNA number), At. Here, r/max = 10, F — 1. Compared to Fig. ITR stops being beneficial compared to DTR at lower C. 
(B) Capacity difference between the ITR and DTR models as a function of C and At. The thick red line marks the (C,M) 
values that lead to equal capacities; in the regime above the red line ITR, in spite of intermediary regulatory steps, is beneficial 
over DTR. 


reduces input noise (favoring large M) vs. the reduction 
of the output noise (favoring large Amax, i-e. smaller M). 
M* can be found by first optimizing log 2 Z{M) for fixed 
M and then optimizing /(c; g) over M in the second step. 

Figure compares the information capacities of the 
ITR (red lines) and DTR (black lines) scenarios for three 
fixed values of the total mRNA number (“cost”) Ad 
(10,100,1000). Two effects can be observed: First, in 
both scenarios the capacities increase markedly with Ad; 
this is expected, because increasing Ad reduces the over¬ 
all noise in both scenarios. Note, however, that the gain 
in capacity upon increasing Ad is larger in the ITR sce¬ 
nario (compare red curves) than in the DTR scenario 
(compare black curves), in particular at low C. This is 
also expected because the ITR scheme is more efficient in 
suppressing input noise; in accordance, optimal mRNA 
values M* at low C are higher than at high C (data not 
shown). Second, due to the additional resource require¬ 
ments of the ITR scheme, it stops being beneficial over 
the DTR scheme already at lower C, compared to the 
situation where constrained resources are not taken into 
account in Fig. 

We mapped, systematically, the conditions under 
which ITR becomes beneficial over DTR. Figure 
shows the capacity difference /itr — .^dtr as a function 
of the two main factors that influence capacity, C and 
Ad. The thick red line marks the combinations (C, Ad) 
that lead to equal capacity in both models, i.e. it summa¬ 
rizes the crossing points between ITR and DTR curves in 
Fig.[^ (thick red points); above the line ITR is superior 


to DTR, below DTR yields higher capacities. ITR is ben¬ 
eficial at low C (i.e., high input noise), and as C increases, 
ITR remains beneficial only for resource-intensive regu¬ 
latory schemes whose cost Ad grows sufficiently quickly 
with C. We note, once again, the large size of these in¬ 
formation differences, in bits. 


V. DISCUSSION 

An efficient regulatory pathway will respond to varia¬ 
tion in its signal across the full input dynamic range. But 
what if a part of this dynamic range is associated with 
very high input noise, as is the case for transcription 
factor signals at low concentration? Then, the pathway 
can either avoid responding to those signals completely, 
thereby sacrificing some of the bandwidth, or it can uti¬ 
lize reaction schemes that are able to reduce the impact 
of high input noise. In this paper we showed that in¬ 
direct translational regulation (ITR) is one such regula¬ 
tory scheme. The intuitive reason for the advantage of 
translational regulation is that every single one of the M 
mRNA molecules that is being translationally regulated 
by the input signal effectively acts as a “receptor” for 
the input concentration; this results in an M-fold more 
efficient averaging of input noise compared to the case 
of direct regulation, where a single DNA binding site is 
acting alone as a receptor. Consistent with this intu¬ 
ition, our results show that when input noise is dominant 
(C ^ 1), translational repression with high M can pro- 
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vide large increases in channel capacity relative to direct 
regulation. 

Increases in capacity, however, do not come for free. 
First, the ITR scheme involves additional reaction steps 
and intermediary regulatory molecules; as a consequence, 
new noise sources, which we call “ITR noise,” are intro¬ 
duced. Only when these sources are sufficiently small 
relative to the input noise set by C, does the transla¬ 
tional scheme yield measurable benefits. Second, lower¬ 
ing ITR noise sources also incurs metabolic costs associ¬ 
ated with producing the required intermediary molecules; 
this means that a realistic comparison between the direct 
scheme and the translational repression scheme is rele¬ 
vant only when carried out at comparable resources. We 
explored these effects in detail to show when translational 
repression is beneficial over direct regulation. 

In this work we have analyzed only one particular reac¬ 
tion scheme for translational regulation, but clearly many 
variations on the same idea are possible. We examined 
several of these possibilities, and found consistent results. 
First, the transformation of the input c into the gene ex¬ 
pression level g in the DTR as well as the ITR scheme can 
be either repressive or activating. We find no qualitative 
differences between the two schemes, and thus present 
only the repressive scheme. The channel capacities of 
the two schemes, separately optimized, differ by ~ O.I 
bits, with activation performing slightly better at C < 1, 
and vice versa. While the shapes of the regulatory curves 
must be different by construction, the general trends— 
sharper regulatory curves already for low C in the ITR 
scheme, lower dissociation constants for high C in the 
ITR scheme—for the activators are identical to those of 
the repressors. 


Second, we examined how the precise description of 
the input noise form (<I>b in Eq. @) influences our re¬ 
sults. In one limit, we can set <I>b = correspond¬ 
ing to a single binding site, while in the opposite limits 
we can set = $ 00 , corresponding to a large num¬ 
ber of binding sites [5^ [311 l^- Tracing through the 
full numerical analysis in both cases, all differences are 
small, and largely confined to the regime in which both 
C and M are small. The input noise depends on diffusion 
constants, and we assumed equal diffusion constants for 
the input and the intermediary molecules, and further 
that translationally regulated mRNA are immobile. Re¬ 
laxing these assumptions changes the effective diffusion 
rates in the diffusion noise terms; in particular, highly 
mobile mRNA might further contribute to input noise 
reduction in the ITR model, Eq. (191. Lastly, we had 
to make assumptions about how the translationally reg¬ 
ulated mRNA are produced and how they interact with 
the inputs. Apart from a change in ITR noise magni¬ 
tude, the model where translationally regulated mRNA 
are present at a molecule count behaves identically to the 
model we studied, where mRNA are continuously made 
and degraded. We have, however, not analyzed more 
complicated interaction schemes between the input and 
the mRNA, e.g., one in which the mRNA would sequester 


the input transcription factors. 

What are the limits to noise reduction using trans¬ 
lational regulation? In the pedagogical example of the 
promoter with M regulatory sites, the sites would be 
packed very closely on the DNA and thus the molecule 
“detected” by one site would have a high likelihood of 
rebinding to a nearby site, providing a statistically cor¬ 
related, rather than independent, sample of the concen¬ 
tration in the bulk. In this regime, the decrease in in¬ 
put noise variance would be slower than 1/M, and as 
M —> 00 , the input noise would saturate to a bound 
determined by the linear dimension of the cluster of reg¬ 
ulatory sites HUHlIiS]. The same argument will ulti¬ 
mately apply to M mRNA molecules of the ITR sce¬ 
nario. There, each mRNA harbors a binding site of size 
£c 1 — 10 nm, but the mRNAs typically are sepa¬ 
rated by distances at the nuclear scale d ~ 1 — 10 /rm. 
Roughly, we can expect that input readouts will be in¬ 
dependent and the ITR mechanism effective so long as 
M < d/ic ~ 10^ — 10"^, putting M in the range that 
we examined and where it appears biologically plausible. 
With these M, we find significant increases in capacity 
of 0.8 — 1.8 bits at C ^ 0.1 — 1, with optimal regula¬ 
tory curves using high Hill coefficients \H*\ > 4, thereby 
accessing the full output dynamic range. Moreover, the 
molecular mechanism for integrating the readout over M 
such “receptors” is simple diffusion of the intermediary 
protein y, unlike in the pedagogical example of M regu¬ 
latory sites on the DNA where we know of no plausible 
molecular integration mechanism. 

Are there biological examples of such indirect transla¬ 
tional regulation scheme? During early Drosophila mor¬ 
phogenesis, one of the primary transcription factors that 
drive the anterior-posterior (AP) patterning of the em¬ 
bryo is Bicoid, which is established in a graded, exponen¬ 
tially decaying profile along the AP axis [53H57] . Bicoid 
activates a number of downstream genes directly in the 
anterior and the middle region of the embryo where its 
concentration is highest. Absolute concentrations of bi¬ 
coid are estimated to be ^ 55 nM at a maximum, falling 
to ^ 8 nM at the midpoint of the embryo [531135]). At the 
posterior end of the embryo, where its concentration is 
low and hard to detect quantitatively using the imaging 
methods available. Bicoid translationally represses caudal 

mRNA |331I351 [531130] . 

The caudal mRNA molecules are produced by the 
mother, who deposits them into the egg with a uniform 
distribution along the AP axis. As they are bound and re¬ 
pressed by Bicoid, while the free ones get translated into 
Caudal protein, the embryo develops a new gradient of 
Caudal protein with high concentration at the posterior 
and low in the anterior. Caudal acts much like the in¬ 
termediary protein y in our model, becoming a regulator 
of patterning genes in the posterior, with c representing 
the Bicoid concentration. While such a scheme involving 
an intermediary maternal mRNA and protein appears 
wasteful at first glance, here we have shown that it may 
provide a substantial benefit for information transmis- 
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sion at low input concentration, i.e. in posterior regions 
of the embryo. Although we are not familiar with any 
direct measurement of caudal inRNA copy numbers, the 
copy numbers for the gap genes [SI] (and also bicoid |S^ 1 
are consistent with the range of M ^ 10^ — 10^ mRNA 
per nucleus. More generally, it is interesting to note that 
bicoid is just one member of a larger homeodomain TF 
family whose members have the ability to act both as a 
transcriptional as well as translational regulators mi. 
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Appendix A: Shot-noise propagation in a generic 
signaling cascade 

Let us assume the following signaling cascade, de¬ 
scribed by a system of Langevin equations for the signal¬ 
ing species ccj, in which shot noise is generated at each 
level j G [l,..,n] and (for j < n) propagated into the 
copy number of species Xj+i via a regulatory function 
fj+iixj). 

dxi 1 

=ri -a;i -f Cl 

at Ti 


(Al) 


1 

^=r„/„(a;„_i)--a;„+C„ , 

dt Tn 

In the steady state, for the noise powers of the Langevin 
noise sources Cj we have 

={Rj + <5,fcd(f - t') = ‘^S.kSit - t') , 

V u / u 

(A2) 

where i?i = ri and Rjyi = rjfj(xj-i), and xj = (xj) 
denotes the stationary mean. For our purposes we want 
to compute the overall variance in the last component 

Linearizing around the means xj via xj = Xj + Sxj 
and fj{xj-i) ~ fj{xj-i) + f'j{xj-i)6xj-i, and Fou rier- 
transforming for t ^ lo allows us to convert Eqs. (Al) 


into the following equation system for the Fourier- 


transformed fluctuations 5xj: 


— iujSxi = - 5xi -I- Cl 


icoSXn — Xnf^(^Xn—i^SXn—i dXri Cn (-^^) 
For the Fourier-transformed noise powers Cj we get 

= 2T-^XjSjk- (A4) 


The system defined by Eqs. (A3) can be solved alge¬ 
braically by succesively inserting the solution for Sxj into 
the equation for Sxj+i: 


(5ii = 


Cl 


r, ^ — iu! 


6xn = — 


n 




. ^ — iw ^ T„ ^ — iuj 

1=1 1 q=3+i ^ 


(A5) 


where we abbreviate /' = f'^(xq-i). 

We can now obtain the variance by integrating over 
the noise power spectrum S'„(w) = (SxnSx^) of the fluc¬ 
tuations in component n: 

o-n = / = J ^i^S:n6x*„) 


_ fduj^^ ‘^■XjT- 

“ / dU 


n 


271 ^ T- ^ -I- a;2 Tq 

1=1 7 g=i-i-i 1 


-2 


(A6) 


where we recall that for mixed indices (Cj(‘^)Cfc('^)) = 0. 

Isolating the n-term of the sum, and introducing the 
dimensionless integration variable w = T„a;, we can fur¬ 
ther write: 


Hr) Xr, 


dw 1 

TT 1 + 


n—1 

E 

1=1 ■ 


dw 


7 V T, 


1-f 


_ 111 




g^j+1 1 + 


-^w 


(A7) 


The integrand of the second integral can be written as: 


k=j 



7‘fc 

- W 

1 

Tn 


-1 


1 f 

1 -)- y;2 n ( 

fe=i \ 



Tk 

- W 

1 

Tn 


-1 


(AS) 


The leading factor (l-|-w^)“^ only has significant contri¬ 
butions when I re I < 1, and will be suppressed (together 
with the whole integrand) for |r(;| 3> 1. Assuming that r„ 
is the longest timescale of the problem, i.e. : Tn Xk, 
in the relevant regime |ui| < 1 all factors except for the 
leading one will be ~ 1. Thus, to a good approximation: 
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